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Abstract 

Background: Peanut {Arachis hypogaea L.) is a crop of economic and social importance, mainly in tropical areas, 
and developing countries. Its molecular breeding has been hindered by a shortage of polymorphic genetic markers 
due to a very narrow genetic base. Microsatellites (SSRs) are markers of choice in peanut because they are co- 
dominant, highly transferrable between species and easily applicable in the allotetraploid genome. In spite of 
substantial effort over the last few years by a number of research groups, the number of SSRs that are polymorphic 
for A. hypogaea is still limiting for routine application, creating the demand for the discovery of more markers 
polymorphic within cultivated germplasm. 

Findings: A plasmid genomic library enriched for TC/AG repeats was constructed and 1401 clones sequenced. 
From the sequences obtained 146 primer pairs flanking mostly TC microsatellites were developed. The average 
number of repeat motifs amplified was 23. These 146 markers were characterized on 22 genotypes of cultivated 
peanut. In total 78 of the markers were polymorphic within cultivated germplasm. Most of those 78 markers were 
highly informative with an average of 5.4 alleles per locus being amplified. Average gene diversity index (GD) was 
0.6, and 66 markers showed a GD of more than 0.5. Genetic relationship analysis was performed and corroborated 
the current taxonomical classification of A. hypogaea subspecies and varieties. 

Conclusions: The microsatellite markers described here are a useful resource for genetics and genomics in Arachis. 
In particular, the 66 markers that are highly polymorphic in cultivated peanut are a significant step towards routine 
genetic mapping and marker-assisted selection for the crop. 



Background 

Peanut (Arachis hypogaea L.) is an oil crop of great 
importance in the tropics: in Africa, its production is 
comparable to all other grain legumes put together, and 
in Asia it provides about the same number of calories as 
soya (FAO, 2009). It has a narrow genetic base due to 
its recent origin event of tetraploidization [1,2], and this 
has hindered the application of molecular breeding in 
this crop. 

Microsatellites or simple sequence repeats (SSRs) are 
useful molecular markers, are abundant, highly dis- 
persed through the genomes of eukaryotes, and locus 
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specific. In addition they are the ideal markers for geno- 
typing allotetraploid species, such as peanut, since they 
are usually co-dominant and multi-allelic. They are con- 
sidered suitable as tools for genetic diversity studies, 
genetic linkage mapping, and for use in plant breeding 
programs [3]. 

Over the past years several research groups have put 
considerable effort into developing SSR markers for the 
genus Arachis in general and cultivated peanut in parti- 
cular. Now about 5,000 SSR markers have been published 
[4-21]. These markers have been mainly used for diver- 
sity studies of germplasm, and for genetic mapping 
([10,11,22-28]. However, in spite of the number of mar- 
kers available, the very low polymorphism observed 
within cultivated germplasm requires large-scale marker 
screening for the identification of sufficient polymorphic 



O© 2012 Macedo et al; BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons 
BiolVlGCl Centrsl Attribution License (http://creativecommons.Org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in 
any medium, provided the original work is properly cited 



Macedo ef al. BMC Research Notes 201 2, 5:86 
http://www.biomedcentral.eom/1756-0500/5/86 



Page 2 of 10 



markers even for low density genetic maps in populations 
derived from cultivated x cultivated crosses. For example, 
in spite of extensive marker screening, the published 
SSR-based maps of cultivated peanut have only 131, 135 
and 175 SSR markers [22,23,28]. In a previous study, we 
observed that AG/TC microsatellites were more poly- 
morphic than AC/TG ones and that for cultivated germ- 
plasm, the highest polymorphism was observed for 
microsatellites with 21-25 motif repetitions [10]. In this 
context, we isolated and characterized long repeat AG/ 
TC SSRs as an effort to develop markers with high poly- 
morphism levels for cultivated peanut [10]. 

Findings 

Sequencing 

Sequences were obtained for 1401 cloned genomic frag- 
ments. Most fragments were sequenced in both forward 
and reverse orientations. Of these 1401 cloned frag- 
ments, 65 harbored sequences very similar to already 
published markers and so were excluded from further 
analysis (> 50% of sequence with BLAST detected simi- 
larity with E-value < E-40). Of the remaining sequences, 
193 harbored microsatellite repeats. As expected, most 
were TC/AG repeats. The 143 unique SSR sequences 
were deposited in GenBank (accession numbers 
JN887491 to JN887636). 

Design of flanking primer pairs 

Of the 193 selected sequences, 135 were appropriate for 
primer design. Some sequences contained multiple 
microsatellite repeats that could not be flanked by a sin- 
gle primer pair. Therefore, in total 146 primer pairs 
were designed. The microsatellites amplified were gener- 
ally long, the average number of motif repeats being 23. 

Polymorphism levels 

All 146 primer pairs amplified PGR products of the 
expected size. Of these, 85 were polymorphic within the 
tetraploid samples (including cultivated peanut, a syn- 
thetic allotetraploid and an accession of the tetraploid 
wild species, A. monticola (Table 1), and 78 were poly- 
morphic within cultivated germplasm (Table 2). 

The average number of alleles amplified per locus was 
5.5, values of Gene Diversity (GD) were between 0.080 
and 0.885, with an average of 0.614. Sixty-six markers 
were highly polymorphic with a GD of more than or 
equal to 0.5. 

Within cultivated peanut, markers with 21-25 motif 
repetitions were the most polymorphic (69%), followed 
by markers that amplified more than 30 motif repeti- 
tions (60%, most of the markers being composite or 
imperfect) (Figure 1). The lowest polymorphism was 
observed with short microsatellites, between 6-10 motif 
repetitions. 



Genie content 

Thirty-six of the 135 marker sequences encoded puta- 
tive proteins that had significant BLAST similarities to 
known predicted proteins oi Arabidopsis and/or legumes 
(E-value < 1*^'°^ (Additional File 1: Table SI). Of the 
highly polymorphic markers (GD > 0.5), 23% showed a 
significant BLAST similarity. This compares to 35% of 
the markers with GD < 0.5 that do not show significant 
BLAST similarity. 

Genetic relationships 

Genetic similarities were estimated by the band-sharing 
coefficient [29] in pairwise comparisons of the 24 geno- 
types (Table 1), using 78 microsatellite loci. Genetic 
similarity values ranged from 0.42-0.77, considering the 
22 A. hypogaea genotypes used. Therefore all the geno- 
types were differentiated. A dendrogram based on 
UPGMA was constructed for the 24 genotypes (Figure 
2). Cluster analysis showed two main groups according 
to the subspecies. Within these groups, genotypes of the 
same botanical varieties tended to group together. 

Discussion 

In spite of the considerable effort made by several 
research groups to develop molecular markers for culti- 
vated peanut, the number of polymorphic markers avail- 
able for this important crop is still limiting. One of the 
main challenges in the construction of linkage maps 
using populations derived from cultivated x cultivated 
crosses is the need to screen thousands of markers to 
obtain sufficient markers even for the construction of 
low density maps. 

In this study we focused on the class of microsatellites 
that was shown to be the most highly polymorphic for 
cultivated peanut in a previous study, long TC repeats 
[10] For this, sequences were obtained from an enriched 
genomic library. For processing the sequences, the Sta- 
den software was used together with a module for the 
detection of microsatellites. Starting from a relatively 
large dataset of unassembled sequences, it was possible 
to quickly eliminate sequences that were similar to pre- 
viously described markers, and assemble a compact 
database of microsatellite containing sequences. Using a 
naming convention of plasmid clones, it was possible to 
correctly assemble microsatellite-containing reads even 
when the only overlap between forward and reverse 
sequences were microsatellite repeats. This was particu- 
larly important for obtaining complete sequences when 
the repeats were long. For design of primer pairs, the 
program used took into account the quality values of 
consensus bases. This was reflected in the 100% success 
rate of amplification of the primer pairs. 

Markers with 21-25 motif repetitions were the most 
polymorphic, while markers with shorter repeats tended 
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Table 1 Arachis genotypes included In this study. 



Genotype 


Species/Subspecies/Variety 


Origin 


cv. BR 1 


A. hypogoeo subsp. fastigioto var. fastigioto 


Embrapa Cotton, Brazil 


cv. IAC-Caiap6 


A. hypogoeo subsp. hypogoeo var. hypogoeo 


lAL, Sao Paulo, Drazil 


cv. lAL Kunner 886 


A. hypogoeo subsp. hypogoeo var. hypogoeo 


lAL, bao raulo, Drazil 


cv. lAC-Tatu 


A. hypogoeo subsp. fastigioto var. fastigioto 


lAL, bao raulo, orazil 


CV. IAL-d0z4 


A. hypogoeo subsp. hypogoeo var. hypogoeo 


lAL, bao Haulo, Hrazil 


Mf210 


A. hypogoeo subsp. fastigioto var. fastigioto 


Misiones, Argentina 


Mf2352 


A. hypogoeo subsp. fastigioto var. aequotoriona 


Pichlncha, Ecuador 


Mf25 1 7 


A. hypogoeo subsp. fastigioto var. peruviono 


Guayas, Ecuador 


MlZD34 


A. hypogoeo subsp. hypogoeo var. hirsuto 


Pichlncha, Ecuador 


Mtzool 


A. hypogoeo subsp. fastigioto var. vulgoris 


Georgia, USA 


Ml JzO/ 


A. hypogoeo subsp. fastigioto var. vulgoris 


Asuncion, Paraguay 


Mfl625 


A. hypogoeo subsp. fastigioto var. vulgoris 


Minas Gerals, Brazil 


Ml Jol 8 


A. hypogoeo subsp. hypogoeo var. hirsuto 


La Libertad, Peru 


Miz4jU 


A. hypogoeo subsp. fastigioto var. peruviono 


Loja, Ecuador 


Ml J /o4 


A. hypogoeo subsp. fastigioto var. peruviana 


Loreto, Peru 




A. hypogoeo subsp. fastigioto var. aequotoriona 


Lima, Peru 


(VII JOCH- 


/i. I lyfjuyucu buubp. lu^iiyiuiu vdi. ucLjuuiuiiunu 


san h/lar1"in Don i 
Jdll IVldlLIII, rclU 


Mf391 1 


A. hypogaea subsp. hypogaea var. hirsuta 


La Libertad, Peru 


Mf574 


A. hypogaea subsp. hypogaea var. hypogaea 


Santa Cruz, Bolivia 


Mf788 


A. hypogaea subsp. hypogaea var. hypogaea 


Tarija, Bolivia 


Of 106 


A. hypogaea subsp. hypogaea type Xingu 


Xingu Indigenous Park, Brazil 


Of 107 


A. hypogaea subsp. hypogaea type Xingu 


Xingu indigenous Park, Brazil 


VI 41 65 


A. monticola 


Jujuy, Argentina 


{K30076xV14167f^ 


A. ipaensis x A. duranensis 


Embrapa Cenargen, Brazil 



Genotypes of Arachis hypogaea, A. monticola, and a synthetic allotetraploid used for SSR marker validation and for genetic relationship analysis. Arachis hypogaea 
genotypes represent both subspecies {hypogaea and fastigiata) and the six botanical varieties (hypogaea, hirsuta, fastigiata, vulgaris, aequatoriana, and peruviana). 
(K30076XV14167)'"' is a synthetic allotetraploid. lAC is Campinas Agronomic Institute 



to be less polymorphic. This general tendency agrees 
with previous studies and reinforces the view that long 
(21-25 motif repetitions) or composite TC microsatel- 
lites are probably the most polymorphic marker class 
for cultivated peanut. A slightly higher proportion of 
markers that were not polymorphic or less informative 
(GD < 0.5) showed significant similarities to protein 
encoding regions, probably reflecting a tendency for 
non-coding regions to be more polymorphic than cod- 
ing regions. Overall 78 of the markers were polymorphic 
for the cultivated accessions and 66 of these had GD 
value of 0.5 or above. 

Cluster analysis showed two main groups separating 
the two subspecies of A. hypogaea. Some tendency of 
grouping of genotypes according to their botanical vari- 
eties was also evident. The main exceptions were three 
accessions, Mf2517, Mf2352, and Mf2534, which clus- 
tered with no apparent reason. The upper group con- 
tained the five hypogaea I hypogaea genotypes and two of 
the three hypogaea I hirsuta genotypes. Arachis monticola 
and the two genotypes collected in the Xingu 



Indigenous Park also clustered in this group. The Xingu 
material has some morphological traits, especially in the 
pods, exceeding the previously variation described in 
cultivated peanut [30], but it seems to be closely related 
to hypogaea I hypogaea and hypogaea I hirsuta varieties. 
Our results also showed the great genetic similarity of 
the varieties fastigiata and vulgaris, which formed a sub- 
group, and peruviana and aequatoriana, which formed a 
separate subgroup. Some studies have shown that geno- 
types of the varieties peruviana and aequatoriana were 
more closely related to genotypes of the subspecies 
hypogaea than to the other two varieties (fastigiata and 
vulgaris) of snhs^ecisf, fastigiata [8,17,31,32]. Our 
results, in contrast, corroborated the current taxonomi- 
cal classification, despite the small number of genotypes 
included. 

Conclusion 

In this study 146 new microsatellite markers were devel- 
oped for Arachis. All of these markers are new and use- 
ful tools for genetics and genomics in Arachis, but in 
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Table 2 List of the 78 polymorphic markers. 



Table 2 List of the 78 polymorphic markers. (Continued) 



Marker 


Major Allele 


Allele 


Gene 


name 


Frequency 


number 


Diversity 


TCI 3C03 


0.4737 


3 


0.5873 


TCI 3 EOS 


0.4773 


8 


0.71 1 8 




0.5000 


5 


0.6621 


IL14n(J9 


0.9565 


2 


0.0832 


TC15F1 2 


0.31 82 


1 2 


0.8388 


ILloA lua 


0.5786 


6 


0.51 02 


T/~ 1 A 1 A U 
IL lOAlUD 


0.3333 


5 


0.7222 


T/^ T A A A ") 


0.6667 


2 


U.4444 


~T/~ T A A A T U 


0.5238 


4 


0.6270 


~T/~ 1 A D A "7 

ILl 9d(J/ 


0.2500 


8 


0.8472 


TC19E01 


0.6667 


3 


0.5000 


~T/ ) A D A r 


0.4762 


7 


0.7086 




0.4545 


7 


0.6374 


~r/ ) ACAO 


0.4231 


5 


0.7278 


T/ ) 1 A AA 

iCzi Aoy 


0.5435 


4 


0.601 1 


TC21C03 


0.5000 


6 


0.6791 


ILzl UOoa 


0.9286 


2 


0.1 327 


Tf — ) 1 r~iA;; U 

ICzl UOoD 


0.391 3 


3 


0.6578 


TC2 1 GO 1 


0.2368 


6 


0.8075 


TC22B07 


0.9583 


2 


0.0799 


~T/ — ) t A A 


0.81 25 


3 


0.31 1 6 


TC22G05 


0.4750 


5 


0.5700 


TC22H1 2 


0.521 7 


6 


0.6522 


~T/ — n D 1 A 


0.4565 


5 


0.6408 


~T/ ) D A O 


0.81 58 


3 


0.3144 


T"/ ) A O U 


0.9348 


3 


0.1 238 


\LZ5u\j4 


0.3043 


6 


0.7741 




0.3750 


6 


0.7188 


Tl — 1 0 C A A U 


0.5000 


4 


0.5564 


TC23F04 


0.8696 


4 


0.2382 


TC23F09 


0.5000 


3 


0.5450 




0.2273 


1 0 


0.8636 


T/ — ) /I A AiC 

1 Lz4AUo 


0.3214 


7 


0.7985 


Tf — 1 y! D A C 


0.2000 


1 2 


0.8850 


1 L24L(Joa 


0.31 82 


1 0 


0.8068 


~r/ — ) / r~iA;; u 
IL/4UUOD 


0.6042 


5 


0.5460 


TC24D1 2 


0.5500 


3 


0.5950 


TC24E01 


0.9583 


2 


0.0799 


TC24G1 0 


0.5000 


5 


0.6796 


TC25B04 


0.4000 


7 


0.7025 


TC25F03 


0.5833 


5 


0.5747 


TC25Gn 


0.3478 


7 


0.7561 


TC27H12 


0.6042 


5 


0.5868 


TC28A12 


0.2917 


8 


0.8090 


TC28B01 


0.2381 


8 


0.8345 


TC28B07 


0.4750 


8 


0.7213 



TC28E09 


0.8261 


2 


0.2873 


TC29C07b 


0.3913 


5 


0.6909 


TC29H08 


0.5909 


3 


0.5165 


TC30D04 


0.5217 


6 


0.6682 


TC31C09 


0.5417 


3 


0.5799 


TC31Glla 


04000 


8 


0.7663 


TC31Gllb 


0.3750 


4 


0.7049 


TC31H02 


0.3636 


6 


0.7479 


TC31H03 


0.6190 


4 


0.5488 


TC31H06 


04545 


7 


0.7293 


TC34E12 


04737 


4 


0.6482 


TC35F05 


0.3182 


7 


0.7934 


TC36C02 


0.5789 


5 


0.6039 


TC36C03 


0.2778 


6 


0.7778 


TC38A07 


0.5000 


4 


0.5663 


TC38F01 


04375 


7 


0.7144 


TC38H09 


0.2667 


7 


0.8267 


TC39A10 


0.4474 


9 


0.7382 


TC39B04 


0.5000 


4 


0.6400 


TC39C01 


0.5238 


6 


0.6712 


TC39E08 


0.3750 


9 


0.7986 


TC39F01 


0.2353 


8 


0.8443 


TC39F08 


04762 


7 


0.7007 


TC40D04 


0.4667 


5 


0.7022 


TC40E08 


0.9565 


2 


0.0832 


TC41 A05 


0.4706 


6 


0.6990 


TC41A10 


0.5238 


8 


0.6689 


TC41Allb 


0.1875 


10 


0.8750 


TC41C05 


0.9583 


2 


0.0799 


TC41C11 


04773 


6 


0.691 1 


TC42A02 


04583 


3 


0.6424 


TC42A05 


0.5000 


7 


0.6710 


Mean 


0.5056 


5.51 


0.6144 



List of the 78 polymorpliic marl<ers as screened against 22 accessions of A. 
hypogaea. Major allele frequencies, allele number amplified per locus, and 
gene diversity indices are also shown 



particular the set of 66 markers highly polymorphic for 
cultivated peanut are a significant step towards routine 
molecular breeding in this important crop. 

Methods 

Plant material and DNA extraction 

For construction of an SSR-enriched genomic DNA 
library, the peanut genotype A. hypogaea subsp. fasti- 
giata vav. fastigiata cv. lAC-Tatu was used. For marker 
validation and genetic relationship analysis, the follow- 
ing panel was used: a set of 22 A. hypogaea genotypes 
representing all six botanical varieties, a synthetic 
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Number of 
markers 



45 



40 



35 



30 



25 



20 



31K 



50% 



40K 



15 



10 - 



76% 




53% 








60% 


47% 







6-10 



11-15 



16-20 



21-25 



26-30 



>30 



Number of motif repeats 

Figure 1 Frequency of markers detected per repeat size class. Frequency of markers developed in this study, polymorphic (dark grey), and 
monomorphic (light grey) per motif repeat number class. The percentage of polymorphic and monomorphic markers in each class is indicated 
on each bar of the graph. 



allotetraploid (derived from a cross between A. ipaemis 
and A. duranensis) and an accession of the tetraploid 
wild species, A. monticola (Table 1). Marker polymorph- 
ism was also assessed in parents of four mapping popu- 
lations: A. duranensis K7988 x A.stenosperma V10309 
[10,25], A. ipaemis KG30076 x A. magna KG30097 [11], 
A. hypogaea subsp. hypogaea var. hypogaea cv. Runner 
lAC 886, and A. hypogaea subsp. fastigiata var. vulgaris 
cv. Fleur 11 x a synthetic amphidiploid [24] (Additional 
file 1). 

Total genomic DNA was isolated from young leaves 
using the CTAB-based protocol described by Grattapa- 
glia and Sederoff [33] modified by the inclusion of an 
additional precipitation step with 1.2 M NaCl. DNA 
quality and concentration were estimated on agarose gel 
electrophoresis and by spectrophotometry (Genesys 4 - 
Spectronic, Unitech, USA). 

Construction of SSR-enriched library 

A genomic DNA library enriched for the dinucleotide 
repeats TC/AG was constructed as described by Moretz- 
sohn [10]. About nine micrograms of DNA were 
digested with 5flM3AI (Amersham Biosciences, UK) and 



electrophoresed in 0.8% low melting agarose gels to 
select fragments ranging from 200-600 bp. The selected 
fragments were purified from the agarose gels using 
phenol/chloroform, and ligated into Sau?>A\ specific 
adaptors (5'-cagcctagagccgaattcacc-3' and 5'-gatcggt- 
gaaatcggctcaggctg-3'). The ligated fragments were hybri- 
dized to biotinylated (AG)i5 oligonucleotides and 
isolated using streptavidin-coated magnetic beads 
(Dynabeads Streptavidin, Dynal Biotech, Norway). The 
eluted fragments were amplified using one adaptor-spe- 
cific primer, cloned into the pGEM-T Easy vector (Pro- 
mega, WI, USA) and transformed into XLl-Blue E. coli 
cells with blue/white selection (Invitrogen, CA, USA). 
Plasmid DNAs of the positive clones were isolated by 
the alkaline lysis method. Sequencing reactions were 
performed with T7 and SP6 primers and the Big-Dye 
Terminator Cycle Sequencing Kit, version 3.1 (Applied 
Biosystems, CA, USA) using the ABI Prism 377 auto- 
mated DNA sequencer. 

SSR marker development and validation 

Sequences were processed and assembled by using the 
Staden package [34] with the repeat sequence finding 
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I 

0.44 



~l 

0.52 



T" 



~\ 

0.69 



- IAC5024 HH 



-Mf788 HH 



-Runners 86 HH 



- Oiiapo HH 



-Mf3911 HHi 



- Mf574 HH 



-Mf2517FP 



- Amonticola 



-Ofl06HX 



-On07HX 



-Mf3618HHi 



-TATUFF 



-MGIOFF 



-BRl FF 



-Mf3207FV 



-Mfl625 FV 



-Mf2681 FV 



- ME352 FA 



M£2430 FP 



Mf3884FA 



-Mf3892FA 



-MG764FP 



- ME534 HHi 



-K30076xV14167 



~1 

077 



0.60 

Coefficient of Similarity 

Figure 2 Dendrogram based on the band-sharing distances of 24 Arachis genotypes, generated by UPGMA. Tlie letters, after each A. 
hypogaea accession number, refer to the subspecies, varieties, and type: ff-fastigiata/fastigiata; f\/-fastigiata/vuigaris; fA-fastigiata/aequatoriana; 
FP-fastigiata/peruviana; HH-hypogaea/hypogaea; HHi-hypogaea/hirsuta; and HX-hypogaea Xingu type; an accession of A. monticola and a synthetic 
allotetraploid plant (K30076 x VI 41 67)"*'. Note, all accessions were distinguished, the highest point represented on the scale of similarity 
coefficient being 0.77. 
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module TROLL [35] and PrimerS for primer design 
[36], using a module developed by Martins et al. [37]. 
Sequences with more than ten motif repeats were cho- 
sen for primer design. Some sequences with BLASTX 
hits to genes of interest were also included in spite of 
having fewer than ten motif repeats. The parameters for 
primer design were: (1) primer size ranging from 18 bp 
to 25 bp with an optimal length of 20 bp; (2) primer 
Tm (melting temperature) ranging from 57°C to 63°C 
with an optimal temperature of 60°C; and (3) GC con- 
tent ranging from 40%-60%. Default values were used 
for the other parameters. 

PGR reactions contained 10 ng of genomic DNA, 1 U 
of Taq DNA polymerase (Amersham Biosciences, UK), 
Ix PGR buffer (200 mM Tris pH 8.4, 500 mM KGl), 
1.5-2.0 mM MgGla, 200 ^M of each dNTP, and 0.4 ^M 
of each primer, in a final reaction volume of 10 |il. 
Amplifications were carried out in a PTG 100 thermocy- 
cler (MJ Research Inc., MA, USA). PGR conditions were: 
96°G for 5 min, followed by 30 cycles of 94°G for 1 min, 
48-62°G (annealing temperature depending on primer 
pair, see Additional file 1). 

Total genomic DNA was isolated from young leaves 
using the GTAB-based protocol described by Grattapa- 
glia and Sederoff [33] modified by the inclusion of an 
additional precipitation step with 1.2 M NaGl. DNA 
quality and concentration were estimated on agarose gel 
electrophoresis and by spectrophotometry (Genesys 4 - 
Spectronic, Unitech, USA). 

Construction of SSR-enriched library 

A genomic DNA library enriched for the dinucleotide 
repeats TG/AG was constructed as described by Moretz- 
sohn [10]. About nine micrograms of DNA were 
digested with Sau'iAl (Amersham Biosciences, UK) and 
electrophoresed in 0.8% low melting agarose gels to 
select fragments ranging from 200-600 bp. The selected 
fragments were purified from the agarose gels using 
phenol/chloroform, and ligated into Sau3Al specific 
adaptors (5'-cagcctagagccgaattcacc-3' and 5'-gatcggt- 
gaaatcggctcaggctg-3'). The ligated fragments were hybri- 
dized to biotinylated (AG)i5 oligonucleotides and 
isolated using streptavidin-coated magnetic beads 
(Dynabeads Streptavidin, Dynal Biotech, Norway). The 
eluted fragments were amplified using one adaptor-spe- 
cific primer, cloned into the pGEM-T Easy vector (Pro- 
mega, WI, USA) and transformed into XLl-Blue E. coli 
cells with blue/white selection (Invitrogen, GA, USA). 
Plasmid DNAs of the positive clones were isolated by 
the alkaline lysis method. Sequencing reactions were 
performed with T7 and SP6 primers and the Big-Dye 
Terminator Gycle Sequencing Kit, version 3.1 (Applied 
Biosystems, GA, USA) using the ABI Prism 377 auto- 
mated DNA sequencer. 



SSR marker development and validation 

Sequences were processed and assembled by using the 
Staden package [34] with the repeat sequence finding 
module TROLL [35] and Primer3 for primer design 
[36], using a module developed by Martins et al. [37]. 
Sequences with more than ten motif repeats were cho- 
sen for primer design. Some sequences with BLASTX 
hits to genes of interest were also included in spite of 
having fewer than ten motif repeats. The parameters for 
primer design were: (1) primer size ranging from 18 bp 
to 25 bp with an optimal length of 20 bp; (2) primer 
Tm (melting temperature) ranging from 57°G to 63°G 
with an optimal temperature of 60°G; and (3) GG con- 
tent ranging from 40%-60%. Default values were used 
for the other parameters. 

PGR reactions contained 10 ng of genomic DNA, 1 U 
of Taq DNA polymerase (Amersham Biosciences, UK), 
Ix PGR buffer (200 mM Tris pH 8.4, 500 mM KGl), 
1.5-2.0 mM MgGla, 200 \iM of each dNTP, and 0.4 ^M 
of each primer, in a final reaction volume of 10 
Amplifications were carried out in a PTG 100 thermocy- 
cler (MJ Research Inc., MA, USA). PGR conditions were: 
96°G for 5 min, followed by 30 cycles of 94°G for 1 min, 
48-62°G (annealing temperature depending on primer 
pair, see Additional file 1). 

Total genomic DNA was isolated from young leaves 
using the GTAB-based protocol described by Grattapa- 
glia and Sederoff [33] modified by the inclusion of an 
additional precipitation step with 1.2 M NaGl. DNA 
quality and concentration were estimated on agarose gel 
electrophoresis and by spectrophotometry (Genesys 4 - 
Spectronic, Unitech, USA). 

Construction of SSR-enriched library 

A genomic DNA library enriched for the dinucleotide 
repeats TG/AG was constructed as described by Moretz- 
sohn [10]. About nine micrograms of DNA were 
digested with S'flM3AI (Amersham Biosciences, UK) and 
electrophoresed in 0.8% low melting agarose gels to 
select fragments ranging from 200-600 bp. The selected 
fragments were purified from the agarose gels using 
phenol/chloroform, and ligated into Sau3Al specific 
adaptors (5'-cagcctagagccgaattcacc-3' and 5'-gatcggt- 
gaaatcggctcaggctg-3')- The ligated fragments were hybri- 
dized to biotinylated (AG)i5 oligonucleotides and 
isolated using streptavidin-coated magnetic beads 
(Dynabeads Streptavidin, Dynal Biotech, Norway). The 
eluted fragments were amplified using one adaptor-spe- 
cific primer, cloned into the pGEM-T Easy vector (Pro- 
mega, WI, USA) and transformed into XLl-Blue E. coli 
cells with blue/white selection (Invitrogen, GA, USA). 
Plasmid DNAs of the positive clones were isolated by 
the alkaline lysis method. Sequencing reactions were 
performed with T7 and SP6 primers and the Big-Dye 



Macedo ef al. BMC Research Notes 201 2, 5:86 
http://www.biomedcentral.eom/1756-0500/5/86 



Page 8 of 10 



Terminator Cycle Sequencing Kit, version 3.1 (Applied 
Biosystems, CA, USA) using the ABI Prism 377 auto- 
mated DNA sequencer. 

SSR marker development and validation 

Sequences were processed and assembled by using the 
Staden package [34] with the repeat sequence finding 
module TROLL [35] and Primer3 for primer design 
[36], using a module developed by Martins et al. [37]. 
Sequences with more than ten motif repeats were cho- 
sen for primer design. Some sequences with BLASTX 
hits to genes of interest were also included in spite of 
having fewer than ten motif repeats. The parameters for 
primer design were: (1) primer size ranging from 18 bp 
to 25 bp with an optimal length of 20 bp; (2) primer 
Tm (melting temperature) ranging from 57°C to 63°C 
with an optimal temperature of 60°C; and (3) GC con- 
tent ranging from 40%-60%. Default values were used 
for the other parameters. 

PGR reactions contained 10 ng of genomic DNA, 1 U 
of Taq DNA polymerase (Amersham Biosciences, UK), 
Ix PGR buffer (200 mM Tris pH 8.4, 500 mM KGl), 
1.5-2.0 mM MgGla, 200 ^iM of each dNTP, and 0.4 ^M 
of each primer, in a final reaction volume of 10 
Amplifications were carried out in a PTG 100 thermocy- 
cler (MJ Research Inc., MA, USA). PGR conditions were: 
96°G for 5 min, followed by 30 cycles of 94°G for 1 min, 
48-62°G (annealing temperature depending on primer 
pair, see Additional file 1). 

Total genomic DNA was isolated from young leaves 
using the GTAB-based protocol described by Grattapa- 
glia and Sederoff [33] modified by the inclusion of an 
additional precipitation step with 1.2 M NaGl. DNA 
quality and concentration were estimated on agarose gel 
electrophoresis and by spectrophotometry (Genesys 4 - 
Spectronic, Unitech, USA). 

Construction of SSR-enriched library 

A genomic DNA library enriched for the dinucleotide 
repeats TG/AG was constructed as described by Moretz- 
sohn [10]. About nine micrograms of DNA were 
digested with Sau3Al (Amersham Biosciences, UK) and 
electrophoresed in 0.8% low melting agarose gels to 
select fragments ranging from 200-600 bp. The selected 
fragments were purified from the agarose gels using 
phenol/chloroform, and ligated into Sau3Al specific 
adaptors (5'-cagcctagagccgaattcacc-3' and 5'-gatcggt- 
gaaatcggctcaggctg-3'). The ligated fragments were hybri- 
dized to biotinylated (AG)i5 oligonucleotides and 
isolated using streptavidin-coated magnetic beads 
(Dynabeads Streptavidin, Dynal Biotech, Norway). The 
eluted fragments were amplified using one adaptor- 



specific primer, cloned into the pGEM-T Easy vector 
(Promega, WI, USA) and transformed into XL 1 -Blue E. 
coli cells with blue/white selection (Invitrogen, GA, 
USA). Plasmid DNAs of the positive clones were iso- 
lated by the alkaline lysis method. Sequencing reactions 
were performed with T7 and SP6 primers and the Big- 
Dye Terminator Gycle Sequencing Kit, version 3.1 
(Applied Biosystems, GA, USA) using the ABI Prism 
377 automated DNA sequencer. 

SSR marker development and validation 

Sequences were processed and assembled by using the 
Staden package [34] with the repeat sequence finding 
module TROLL [35] and Primer3 for primer design 
[36], using a module developed by Martins et al. [37]. 
Sequences with more than ten motif repeats were cho- 
sen for primer design. Some sequences with BLASTX 
hits to genes of interest were also included in spite of 
having fewer than ten motif repeats. The parameters for 
primer design were: (1) primer size ranging from 18 bp 
to 25 bp with an optimal length of 20 bp; (2) primer 
Tm (melting temperature) ranging from 57°G to 63°G 
with an optimal temperature of 60°G; and (3) GG con- 
tent ranging from 40%-60%. Default values were used 
for the other parameters. 

PGR reactions contained 10 ng of genomic DNA, 1 U 
of Taq DNA polymerase (Amersham Biosciences, UK), 
Ix PGR buffer (200 mM Tris pH 8.4, 500 mM KGl), 
1.5-2.0 mM MgGlz, 200 ^M of each dNTP, and 0.4 pM 
of each primer, in a final reaction volume of 10 [^1. 
Amplifications were carried out in a PTG 100 thermocy- 
cler (MJ Research Inc., MA, USA). PGR conditions were: 
96°G for 5 min, followed by 30 cycles of 94°G for 1 min, 
48-62°G (annealing temperature depending on primer 
pair, see Additional file 1) for 1 min, 72°G for 1 min, 
with a final extension for 10 min at 72°G. PGR products 
were separated by electrophoresis on denaturing polya- 
crylamide gels (6% acrylamide:bisacrylamide 29:1, 5 M 
urea in TBE pH 8.3), stained with silver nitrate [38]. 

Data analyses 

Number of alleles per locus, the range of fragment length 
and gene diversity (GD) were estimated for the poly- 
morphic primers, using the program "Power Marker 
3.25" [39]. Pairwise genetic similarities were estimated 
from the allelic data using the band-sharing coefficient of 
Lynch [29]. The resulting diagonal matrix was then sub- 
mitted to cluster analysis using UPGMA ("unweighted 
pair-group method analysis"). In order to verify the con- 
sistency of the built dendrogram, the cophenetic correla- 
tion - r [40] was calculated. All these analyses were 
performed using the software NTSYS 2.21 [41]. 
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Additional material 



Additional file 1: Table SI. Marker homologies and information The 

file provides information on all markers used in this work: marker name, 
primer sequence, SSR motif, GenBank number, PGR conditions, fragment 
length, sequence homologies and if the primers are polymorphic (P) or 
monomorphic (IVl) for parents of four mapping populations. 
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